Speaker-and-Environment Change Detection in Broadcast News Using Maximum Divergence Common Component GMM
نویسنده
چکیده
In this paper, the supervised maximum-divergence common component GMM (MD-CCGMM) model was used to the speaker-andenvironment change detection in broadcast news signal. In order to discriminate the speaker-and-environment change in broadcast news, the MD-CCGMM signal model will maximize the likelihood of CCGMM signal modeling and the divergence measure of different audio signal segments simultaneously. Performance of the MD-CCGMM model was examined using a four-hour TV broadcast news database. A result of 16.0% Equal Error Rate (EER) was achieved by using the divergence measure of CCGMM model. When using supervised MD-CCGMM model, 14.6% Equal Error Rate can be achieved.
منابع مشابه
Speaker-and-environment change detection in broadcast news using the common component GMM-based divergence measure
In this paper, a GMM with common mixture components, referred to as the common component GMM (CCGMM), is proposed to be the signal model for calculating the diversity measure for the speaker-and-environment change detection in broadcast news signal. The use of GMM is to increase the accuracy of audio signal modeling while the use of common mixture components is to solve the complexity problem o...
متن کاملOn-line incremental speaker adaptation with automatic speaker change detection
In order to improve the performance of speech recognition systems when speakers change frequently and each of them utters a series of several sentences, a new unsupervised, online and incremental speaker adaptation technique combined with automatic detection of speaker changes is proposed. The speaker change is detected by comparing likelihoods using speaker-independent and speaker-adaptive GMM...
متن کاملUniversal Background Models for Real-time Speaker Change Detection
This paper addresses the problem of real-time speaker change detection in TV news broadcast, in which no prior knowledge on speakers is assumed. To remove the unreliable frames and background frames in the speech stream, we propose a new approach for feature categorization based on Gaussian Mixture Model Universal Background Model (GMM-UBM). The feature vectors are categorized into three sets, ...
متن کاملError Detection in Broadcast News ASR Using Markov Chains
This article addresses error detection in broadcast news automatic transcription, as a post-processing stage. Based on the observation that many errors appear in bursts, we investigated the use of Markov Chains (MC) for their temporal modelling capabilities. Experiments were conducted on a large Amercian English broadcast news corpus from NIST. Common features in error detection were used, all ...
متن کاملTwo step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models
This paper addresses the topic of online unsupervised speaker segmentation in a complex audio environment as it is present in the Broadcast News databases. A new two stage speaker change detection algorithm is proposed, which combines the Bayesian Information Criterion with an ABLS-SCD statistical framework where adapted Gaussian mixture models are used to achieve higher accuracy. To enhance th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006